首页> 外文OA文献 >Creating a reference data set for the summarization of discussion forum threads
【2h】

Creating a reference data set for the summarization of discussion forum threads

机译:创建参考数据集以汇总讨论论坛主题

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

In this paper we address extractive summarization of long threads in online discussion fora. We present an elaborate user evaluation study to determine human preferences in forum summarization and to create a reference data set. We showed long threads to ten different raters and asked them to create a summary by selecting the posts that they considered to be the most important for the thread. We study the agreement between human raters on the summarization task, and we show how multiple reference summaries can be combined to develop a successful model for automatic summarization. We found that although the inter-rater agreement for the summarization task was slight to fair, the automatic summarizer obtained reasonable results in terms of precision, recall, and ROUGE. Moreover, when human raters were asked to choose between the summary created by another human and the summary created by our model in a blind side-by-side comparison, they judged the model's summary equal to or better than the human summary in over half of the cases. This shows that even for a summarization task with low inter-rater agreement, a model can be trained that generates sensible summaries. In addition, we investigated the potential for personalized summarization. However, the results for the three raters involved in this experiment were inconclusive. We release the reference summaries as a publicly available dataset.
机译:在本文中,我们将讨论在线讨论论坛中长线程的摘要。我们提出了一项详尽的用户评估研究,以确定论坛摘要中的人员偏好并创建参考数据集。我们向十个不同的评分者显示了较长的主题,并要求他们通过选择他们认为对该主题最重要的帖子来创建摘要。我们研究了人类评分者之间关于汇总任务的协议,并且我们展示了如何将多个参考摘要进行组合以开发成功的自动汇总模型。我们发现,尽管汇总任务的评估者之间的协议有些微不足道,但自动汇总器在精度,召回率和ROUGE方面都取得了合理的结果。此外,当要求评估人在盲目并排比较中从另一个人创建的摘要和我们的模型创建的摘要之间进行选择时,他们判断该模型的摘要等于或优于人类摘要的一半。案件。这表明,即使对于评估者之间的共识度较低的摘要任务,也可以训练生成合理摘要的模型。此外,我们调查了个性化汇总的潜力。但是,该实验涉及的三个评估者的结果尚无定论。我们将参考摘要发布为可公开获得的数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号